CTIVA: Censored time interval variable analysis

Traditionally, datasets with multiple censored time-to-events have not been utilized in multivariate analysis because of their high level of complexity. In this paper, we propose the Censored Time Interval Analysis (CTIVA) method to address this issue. It estimates the joint probability distribution of actual event times in the censored dataset by implementing a statistical probability density estimation technique on the dataset. Based on the acquired event time, CTIVA investigates variables correlated with the interval time of events via statistical tests. The proposed method handles both categorical and continuous variables simultaneously—thus, it is suitable for application on real-world censored time-to-event datasets, which include both categorical and continuous variables. CTIVA outperforms traditional censored time-to-event data handling methods by 5% on simulation data. The average area under the curve (AUC) of the proposed method on the simulation dataset exceeds 0.9 under various conditions. Further, CTIVA yields novel results on National Sample Cohort Demo (NSCD) and proteasome inhibitor bortezomib dataset, a real-world censored time-to-event dataset of medical history of beneficiaries provided by the National Health Insurance Sharing Service (NHISS) and National Center for Biotechnology Information (NCBI). We believe that the development of CTIVA is a milestone in the investigation of variables correlated with interval time of events in presence of censoring.


Introduction
Discovering significant variables related to particular phenomena via multivariate analysis is an important task in several academic fields.In particular, identifying causal relationships between clinical events and gene expression analysis is a major problem in bioinformatics.However, clinical events are censored in most gene expression datasets, hindering the evaluation of relationships between gene expression data and clinical events.Although several methods have been proposed to handle the censored datasets [1][2][3][4][5], handling the samples with multiple censored events remains a complicated task [6].
Development of a method capable of handling multiple censored datasets could provide an effective solution to several problems in multiple fields.For instance, it would be useful for the identification of a gene related to the survival times of patients with carcinoma-the gene can be denoted by a variable, the detection of carcinoma can be considered to be event 1, and detection of death can be considered to be event 2. In such a case, data corresponding to both event 1 and event 2 may be censored in various cases.Although, there exist survival analysis models that handle multiple events such as competing risk regression model, it only handles the correlated events [7].Therefore, method that handles multiple independent censored events, should be further investigated.
Although previous studies handling multiple independent censored events presence, most of the existing works only handle the problems with continuous variables [8].Consequently, the importance of methods that manage multiple censored datasets, accommodating various types of variables, including categorical variables and continuous variables has increased.However, despite its apparent utility, the complexity arising from multiple censoring has hindered its further development.
Traditionally, two approaches have been developed to handle censored data-statistical analysis methods [1,2,9] and deep learning methods [10][11][12][13] are two dominant solutions.Although deep neural network-based methods have been demonstrated to offer effective solutions on censored datasets, the inaccessibility and scarcity of multiple censored datasets limit the implementation of deep learning-based methods as a viable solution [14][15][16].In this study, we propose a statistical density estimation method, called the censored-data time interval variable analysis (CTIVA), based on multiple censored data analysis.
The proposed method handles the multiple censored data through estimating the density function of the dataset and reconstructs the censored events through the estimated density functions.These statistical approaches are aimed to handle the inaccessibility and scarcity of the most censored datasets.Moreover, CTIVA demonstrates practical advantages compared to previous methods.Our proposed method addresses challenges posed by diverse variable types, a facet not adequately tackled by prior studies.Specifically, the proposed method could be implemented in clinical trial and symptoms which could be considered as categorical variables.Thus, the proposed CTIVA has extended the solution of multivariate censored time-to-event dataset to more general fields.

Censored time-to-event data analysis and data generation
The proposed CTIVA method detects variables correlated with the interval time of events in the presence of censoring of the event.It is a generalized version of censored time-to-event data-handling methods, which are usually utilized for genomic data analysis [6].Formally, for each sample i, let us denote the true occurrence times of events 1 and 2 by T i1 and T i2 , respectively.Additionally, the censored times of events 1 and 2 are denoted by C i1 and C i2 , respectively.In the presence of censoring, the censoring time points, V i1 and V i2 , and censoring indicators, Δ i1 and Δ i2 , are observed instead of T i1 , T i2 , C i1 , and C i2 .The censoring time points and indicators are defined as follows: and Δ i2 = I(T i2 �C i2 ) where I(�) denotes an indicator function.CTIVA is proposed to detect variables that are significantly associated with the true interval time, T 2 −T 1 , derived based on the observation data {V 1 , Δ 1 , V 2 , Δ 2 } of n samples.
To acquire a distinctive dataset for the experiment, simulation data are generated using the following procedure.The two actual event times, T 1 and T 2 , are sampled using a chosen probability density function.Subsequently, the two censoring time points, C 1 and C 2 , are sampled using the similar but independent probability density functions from which the actual event times are sampled.
To handle the censored time-to-event data, the proposed CTIVA estimates the distribution of the events statistically using the joint distribution, calculates the conditional expectations of T 2 −T 1 based on the observed {V 1 , Δ 1 , V 2 , Δ 2 }y using Monte Carlo Simulation, and, finally, identifies significant variables based on statistical tests.The detailed procedure of CTIVA is described below.

Joint probability density estimation
Owing to the presence of censoring in the observation dataset, most of the true interval time values cannot be acquired from it.Hence, the f T 1 ;T 2 , the joint probability distribution of T 1 and T 2 is estimated from the observed {V 1 , Δ 1 , V 2 , Δ 2 } by implementing the multivariate survival analysis of the optional Polya tree (OPT) Bayesian estimation [17,18].Although a general pdimensional multivariate problem is handled in the previous studies, the problem is simplified as a two-dimensional bivariate problem in CTIVA for computational efficiency.The simplified method implemented in CTIVA is explained below, the detailed procedure of the original method is described in Seok et al. [18].
CTIVA utilizes an OPT to estimate the joint distribution.The OPT is characterized by a likelihood function F(A) for region A in a sample space O. F(A) is calculated recursively using F(A 11 ), F(A 12 ), F(A 21 ), F(A 22 ) where A ij is defined to be the j-th subregion of partition A, which is split by the center point of the T i axis.Formally, F(A) is calculated through the equation denoted below.
The F 0 (A) is a milestone likelihood value, assuming all sample points in A to follow the uniform distribution, B(�) denotes a beta function, and N(A) denotes the number of samples in region A.
If N(A)<2, F(A) = F 0 (A).Through the multiple recursive calculation and binary splitting, the values F(A) corresponding to all subregions of O can be obtained.Based on the obtained F(A) values, the joint distribution of T 1 and T 2 is calculated via CTIVA by following the steps below.
For an arbitrary subregion A, if BðNðA i1 Þþ0:5;NðA i2 Þþ0:5Þ Bð0:5;0:5Þ is considered to follow a uniform distribution.The probability density of A, the subregion following the uniform distribution, can be calculated as NðAÞ njAj where n is the total number of observed samples and |A| is the area of the A. If The CTIVA performs the proposed task recursively until all partitions of the O are considered to follow the uniform distribution.Using the following process, the probability density of each partition is obtained based on the numbers of samples in each region.
However, the numbers of samples in region A, N(A) cannot be estimated by counting the observations in the partition due to the missing observations occurred by censoring.In the presence of censoring, N(A) is indirectly estimated from the joint distribution f T 1 ;T 2 .For a given joint distribution f, we define N(A|f) be the estimated number of samples in A. Since N (A) values are required for OPT calculation, f T 1 ;T 2 is obtained through the equation below by substituting N(A) to N(A|f).
To solve the proposed equation above, an iterative approach where f ðiþ1Þ T 1 ;T 2 ¼ OPTðNðAjf ðiÞ T 1 ;T 2 ÞÞ is implemented.The proposed CTIVA method obtains the final joint distribution f T 1 ;T 2 by repeating the iteration until f ðiÞ T 1 ;T 2 converges.The initial distribution essential for iteration is obtained from the initial estimation of N(A).N (0) (A) is estimated from T 1 and T 2 , assuming two distributions to be independent in subregion A. Univariate Kaplan-Meier estimators are used for the initial estimation of T 1 and T 2 in region A. Finally, the initial joint probability distribution is given by f ð1Þ T 1 ;T 2 ¼ OPTðNðAjf ð0Þ T 1 ;T 2 ÞÞ.All of the functionally notated values suggested in the OPT algorithm are discrete values which implies the usual convergence of the algorithm.

Time interval estimation
Using the estimated joint distribution f T 1 ;T 2 obtained via the aforementioned steps, the conditional distribution of T 1 and T 2 can be calculated when the observations of sample i are given.The given observation {V i1 , Δ i1 , V i2 , Δ i2 }, can be classified into four cases.In the first case, Δ i1 = 1 and Δ i2 = 1, and censoring is absent.Therefore, T i1 = V i1 and T i2 = V i2 .The second and third cases involve single censoring.In the second case, Δ i2 = 0, and, in the third case, Δ i1 = 0. Thus, ] is obtained.In the final case, censoring exists for both events.Thus, Δ i1 = 0 and Δ i2 = 0 in the final case, and After estimating the density functions in the four cases, is estimated empirically via Monte Carlo Simulation because the analytical calculation of the expectation is complicated and inaccurate due to the presence of random sampling in the OPT algorithm when the censoring exists.To reduce the influence of the random sampling in the OPT algorithm and save the computational resources, Monte Carlo Simulation is implemented to obtain the empirical expectation.From the obtained conditional distributions of T 1 and T 2 , the pairs (T 1 , T 2 ) are randomly sampled for the calculation.The empirical expectation is calculated as the mean of the interval T 2 −T 1 of the generated pair.
Let x ij be the observation of j-th variable, x j , in sample i and y i be the expected interval time, obtained via Monte Carlo simulation.Based on the pairs, (x ij , y i ) for i = 1,2, . ..., n, the statistical relationships between the variables, x j and T 2 −T 1 , can be estimated via several statistical estimation methods, such as analysis of variance (ANOVA), permutation tests, and rank correlation tests.

Simulation settings
The statistical and stochastic approach of CTIVA in handling censored datasets has been demonstrated to be effective in previous studies.However, the direct demonstration of the novelty of the proposed method in real-world problems is difficult owing to poor data accessibility of major censored datasets.Therefore, we first investigate the novelty of the proposed CTIVA method using simulated examples before we testing it on real-world problems.
Initially, a bivariate censored time-to-event dataset comprising 1,000 categorical variables is randomly generated.The data are randomly sampled from three different probability density functions-additive exponential distribution, log-normal distribution, and the Clayton-Oakes model [19].Each dataset consists of 500 generated samples and the experiment is repeated 100 times using the identical generation procedure.A detailed description of the probability density function is provided in Table 1.
Variables sampled from three different distribution have distinct characteristics.The variables sampled from additive exponential distribution have monotonic hazard function while variables sampled from log-normal distribution have non-monotonic hazard function.Unlike the first two, variables sampled from Clayton-Oakes model have inherent dependency between two event times [20].
Out of 1,000 variables, 100 variables were generated to be correlated to the interval time of event 1 and 2, other 100 variables were generated to be correlated to the occurrence time of the event 1, other 100 variables were generated to be correlated to event 2, and the remaining 700 variables were independently generated with the events.The detailed generation procedure for the dataset is summarized in Table 1.The proposed CTIVA was designed to detect the 100 variables that were correlated to the interval time among 1,000 variables.The variables were detected to be correlated according to a statistical significance investigated through the various statistical tests.Since the variable detection results depend on the threshold value, by regulating the threshold of the p-value, the Receiver Operating Characteristic (ROC) curves and Area Under Curve (AUC) values are obtained.
The proposed CTIVA method was compared with various baseline methods for the performance verification.The baseline methods include naïve methods that do not consider the presence of censoring by ignoring the censoring or only considering the sample without censoring.In specific, the naïve method that ignores censoring considers the censored time event as actual observed event.Furthermore, we have also conducted the experiment with Cox Regression Model [3], the method that is dominantly used for handling the censored time to event data.

Simulation results in uncorrelated dataset
The effectiveness of the proposed method to solve real-world problems is demonstrated by applying it to combined variable problems, as real-world problems usually involve both categorical and continuous variables.To generate the combined variable problem, a bivariate censored time-to-event dataset comprising 500 categorical variables and 500 continuous variables is randomly generated.Time-to-events are randomly sampled from the three density functions listed in Table 1.As in the case of the dataset with 1,000 categorical variables, 100 variables that were half continuous and half categorical are observed to be highly correlated with the interval times of both events 1 and 2. Other variables exhibit similar degrees of correlation as in the case of the dataset with 1,000 categorical variables, except that half of them are continuous.The same statistical tests are performed on the combined dataset, except for linear-regression- The T refers to the actual event time of the generated data and C refers to the censored time point of the data.The N (μ T , S T ) refers to the Gaussian normal distribution with μ T as set of means and S T as set of covariances.The S(t based p-value estimation, which is used to replace the ANOVA test for continuous variables.
Both the ROC curve and AUC values are acquired on the combined dataset.The simulation is conducted using identical sample numbers and repetitions as those in the case of 1,000 categorical datasets.Although the tests implemented in the cases of categorical and continuous variables are different, the same p-value threshold is used to calculate the ROC curve and AUC value during the evaluation of the method.The underlying mathematical basis for the equality of thresholds might be insufficient-the experiments are designed to demonstrate the effectiveness of the proposed method in the case of real-world problems, which usually involve both categorical and continuous variables.
The proposed CTIVA method exhibits excellent prediction performance corresponding to both categorical and combined variables that are correlated with the interval time.The performance of the CTIVA is compared with Cox survival analysis model [1] which only considers single censored time event and two naïve methods that do not handle the censoring.The Cox model is implemented with only a single censored time events while the other event is ignored.For the two naïve methods, tests are conducted only for the samples with non-censored data in former case and censoring present in the data is simply ignored for latter case.CTIVA exhibits average AUC values of 0.93, 0.94, and 0.93 in the case of categorical variables-with ANOVA, permutation test, and rank correlation test while the Cox model and two naïve methods exhibit AUC values lower than 0.9.Additionally, the average AUC values of CTIVA in the combined variable case are 0.91, 0.91, and 0.90, which are higher than those of the compared methods.An additive exponential distribution is used as the sampling distribution in the experiment.The average AUC values of CTIVA are observed to be 4.5, 5.6, and 8.1% higher than those of the most competitive alternative considered in the experiment on categorical dataset, 5.8, 5.8, and 6.7% higher on the combined dataset.The detailed comparison results are presented in Table 2.
The box plots of the AUC values obtained via repeated simulation of categorical and combined datasets using different sampling distributions are depicted in Figs 1 and 2 respectively to demonstrate the visualized performance.The constants in Fig 2 denote the power of the additive exponential of the sample distribution.Evidently, the proposed CTIVA method outperforms traditional methods in terms of both average and ranged AUC values on simulation datasets with various environments.
Furthermore, for the cutoff p-value 0.05, the proposed CTIVA showed novel results in both sensitivity and specificity compared to other baseline methods.With the cutoff p-value 0.05 in categorical dataset, the average sensitivity of CTIVA is 0.96 while the average specificity is 0.84.Also in combined dataset, the average sensitivity of CTIVA is 0.84 while the mean specificity is 0.73.The proposed CTIVA consistently outperformed other benchmark methods in terms of sensitivity or specificity.Additionally, the proposed method also demonstrated robust performance across varying p-value thresholds in the sensitivity analysis.The thresholds of 0.1 and 0.01 were additionally examined in the sensitivity analysis.The average sensitivity of the CTIVA is 0.89 and 0.72 in categorical and combined dataset with cutoff p-value of 0.01.The average specificity of the CTIVA is 0.89 and 0.90 in categorical and combined dataset.The detailed comparison result showing the novelty of the proposed CTIVA is provided in S1 and S2 Tables.
The average AUC values of CTIVA model are higher and the variances are lower than compared methods, which implies the stability of the proposed method in changing threshold.Moreover, the proposed method demonstrated novel results in dominant cutoff p-value compared to baseline methods.In conclusion, the suggested CTIVA method showed superior performance in both fixed cutoff value and changing threshold.

Simulation results in correlated dataset
Although the proposed method showed promising results in uncorrelated simulation dataset, the time-interval correlated variables in real-world such as gene expression data are usually correlated by themselves.Therefore, the proposed method should also include the novel result in self-correlated dataset to show the effectiveness of the method in the real-world problems.The self-correlated datasets used in the experiment were generated through following process.Among the 1,000 categorical variables generated through the mentioned procedure, 10 groups each composed of 20 variables were randomly selected.The variables in the same group were correlated by adding the same white gaussian noise.Same as in independent categorical dataset and combined dataset identical statistical tests performed in categorical dataset were executed in the self-correlated dataset.Both the ROC curve and AUC values are acquired to demonstrate the novelty of the method as we have conducted in uncorrelated simulation setting.
The proposed CTIVA method also shows excellent prediction performance in the correlated categorical dataset.The performance of the CTIVA is compared with same baseline methods as we have conducted in the uncorrelated simulation setting.CTIVA exhibits average AUC values of 0.93, 0.94, and 0.93 in the case of categorical variables-with ANOVA, permutation test, and rank correlation test while the Cox survival analysis model and two naïve methods exhibit AUC values lower than 0.9.The average AUC values of CTIVA are observed to be 4.5, 5.6, and 8.1% higher than those of the most competitive alternative considered in the experiment on categorical dataset.The detailed comparison results are presented in Table 3.
Also, an additional experiment performed on the proposed method in the case of a realworld problem is suggested in the next sections to validate the effectiveness of the proposed method.

Real dataset results
The results presented in the previous subsection demonstrate that the proposed CTIVA method exhibits novel performance on simulation problems.In this subsection, its performance is evaluated on the National Sample Cohort Demo (NSCD) dataset, which is a realworld medical history demo dataset provided by the National Health Insurance Sharing Service (NHISS) [21,22].
NSCD contains the medical history of 1,000 randomly selected Korean healthcare beneficiaries sampled between 2002 and 2015.The proposed method is applied to the NSCD dataset to identify significant medical record variables correlated with the time interval between the diagnosis of carcinoma and death.Five medical record variables-FORM_CD, MCARE_-SUBJ_CD, OFIJ_TYPE, OPRTN_YN, and MCARE_RSLT_TYPE-are extracted from the healthcare statement data to test the correlation between the medical record variables and the censored time interval.Of these, MCARE_SUBJ_CD and MCARE_RSLT_TYPE are estimated to be correlated with the interval time when tested using CTIVA-ANOVA because the p-values of the two records are observed to be lower than 0.05.A detailed explanation of the medical record variables and descriptions of the test results are presented in Table 4.
When the diagnosis of carcinoma is considered to be event 1 and death of the beneficiary is considered to be event 2, the Medical Diagnosis Code (MCARE_SUBJ_CD) and Medical Care Result Code (MCARE_RSLT_TYPE) are estimated to be statistically significant corresponding to the interval time.The MCARE_SUBJ_CD and MCARE_RSLT_TYPE codes seem to be correlated with both events 1 and 2 because they are the diagnosis and result codes of the beneficiary during the visitation [23][24][25].
Furthermore, the proposed CTIVA is tested with proteasome inhibitor bortezomib dataset which is open access dataset provided by National Center for Biotechnology Information (NCBI) [26].This dataset demonstrates the correlation between the outcome in clinical trials of the proteasome inhibitor bortezomib patients and gene expression profiling.Among tremendous gene expression data and medical categorical variables in the dataset, CTIVA is conducted to find the correlation between medical categorical variables and time interval between the pharmacogenomics (PGx) progression date and death.Since CTIVA covers both categorical and continuous variables, we have concentrated on medical categorical variable which censored data survival analysis usually does not cover.
Four medical categorical variables-age, sex, PGx response type, number of prior linesare sampled from the proteasome inhibitor bortezomib dataset.The age of the patient is categorized under 65 or not to check whether the patient is in senescence.The sex of the patient was categorized as male or female.The PGx response type of the patient was categorized in six contents in dataset.The number of prior lines indicates the number of distinct treatments a patient has received for the cancer which was ranged from one to four.Among these medical categorical variables, the number of prior lines is known to be directly related with the survival time of the proteasome inhibitor bortezomib patients which indicates the correlation between the number of prior lines and interval time between PGx progression date and death [27].The proposed CTIVA has found the number of prior lines to be statistically significantly correlated with the interval time in p-value 0.001 with ANOVA test.Other three variables are demonstrated to be not correlated showing p-value higher than 0.05 which corresponds with traditional studies [27].The graphical description of the above experimental result is depicted in

Discussion and conclusion
Overall, the proposed CTIVA method exhibits excellent detection performance for variables that are correlated with the interval time of two events on both simulation data and real-world problems.In particular, the proposed method outperforms traditional statistical methods in capturing correlated variables while multiple censoring is presence.Implementing probability density function estimation and time interval estimation with Monte Carlo Estimation, the CTIVA exhibits an average AUC value exceeding 0.93 on simulation data by which outperforms conventional benchmark methods.Also, the CTIVA demonstrated its novel performance with real datasets-NSCD and NCBI datasets, in estimating categorical variables correlated with the interval time of two clinical events.Further, CTIVA effectively estimated correlated variables on a combined dataset comprising both categorical and continuous variables.Although the use of identical p-values acquired via different statistical tests on categorical and continuous variables might not be mathematical justified, the results of the proposed method on simulated data are superior to those of the naïve methods, exhibiting an average AUC value of 0.90 in capturing correlated variables.Thus, the experimental results corroborate the effectiveness of such an approach in handling combined variable censored time-to-event datasets.
Moreover, CTIVA exhibits excellent detection of correlated variables on a real-world medical history dataset and clinical dataset with gene expression analysis.Among the five categorical medical record variables in the NSCD medical history dataset, the proposed method detects two variables correlated to the interval time between diagnosis of carcinoma and fatality.The two detected variables are clinically known to be correlated with the time interval between the two events.Additionally, among four categorical variables in NCBI datasets, the CTIVA captured prior lines to be correlated with the interval time between PG date and fatality.The number of prior lines of the patient is clinically shown to be correlated with the interval time of two events.Similar conclusions are also demonstrated in  Although the CTIVA showed novel results in detecting correlated variables, some limitations exist in the proposed method.Since the proposed method estimates the joint probability distribution with the Bayesian estimation, disparity between the actual distribution and estimated distribution that influences the prediction results might exist.Also, the probability estimation of the proposed CTIVA relies on the convergence of the density estimation function.Therefore, computational cost of the proposed method might surge for the more accurate estimation of censored data compared to conventional benchmark methods.
In conclusion, the proposed CTIVA method is verified to be capable of effectively handling combined multivariate censored time-to-event datasets, which is the dominant type of censored dataset that appears in real-world problems.Despite some inherent limitations of the proposed method, we believe that the development of CTIVA is a milestone in research on handling combined multivariate censored time-to-event problems owing to its excellent performance on both simulation data and real-world problems.Further, we believe our milestone could contribute to real-world problems handling multivariate censored time-to-event problems with multiple censorings such as the expected survival time of carcinoma patients.

Fig 1 .Fig 2 .
Fig 1. Box plot of AUC in predicting categorical variables with ANOVA test.The CTIVA indicates the proposed method while other methods are baseline methods depicted for comparison.The Cox1 and Cox2 describes the AUC results of Cox-survival analysis model each applied in censored time event 1 and 2. The IGN denotes the baseline methods that ignores the censoring and the NC denotes the baseline methods that only handles the non-censored data.The data of the experimental results were sampled from (a) an additive exponential distribution, (b) the log-normal distribution, and the (c) Clayton-Oakes model.https://doi.org/10.1371/journal.pone.0294513.g001

Fig 3 .
From the observed censored dataset, the proposed CTIVA estimates the actual event time and through the ANOVA test captures the correlated variables.Data which both of the events are censored is shown in Fig 3(A).The box plot in Fig 3(B) and 3(C) demonstrates the values of the interval time of the CTIVA estimated data and raw censored data with both events censored which is demonstrated as scatter plot in Fig 3(A).The horizontal label of the box plots indicates the variable number of prior lines that are known to be correlated with the interval time.The results shown in Fig 3 indicates the patient with less number of lines tend to Fig 3 which indicates the patient with less number of lines tend to survive longer.

Fig 3 .
Fig 3. Graphical description of CTIVA with NCBI dataset.Observations which both PGx progression date and death are censored are used in the experiments.The triangle mark in (a) indicates raw censored data while dot mark indicates data estimated with CTIVA.The p-values acquired through ANOVA test in raw censored data and CTIVA estimated data are described in the title of (b) and (c).The interval times between PGx progression date and fatality for each number of prior lines are demonstrated in box plot (b) and (c).https://doi.org/10.1371/journal.pone.0294513.g003

Table 2 . The AUC comparison results of proposed CTIVA and other baseline models in uncorrelated dataset.
The dataset was sampled from an additive exponential distribution with different statistical tests.https://doi.org/10.1371/journal.pone.0294513.t002

Table 4 . Description of NSCD Dataset and CTIVA-ANOVA analysis results.
[27] longer interval time between events which corresponds with the traditional studies[27].The overall results of proteasome inhibitor bortezomib dataset can be acquired by running the source code provided in the github link below.[https://github.com/Insoo-K/CTIVA]